Handling Large Workloads by Profiling and Clustering

نویسنده

  • Matteo Golfarelli
چکیده

View materialization is recognized to be one of the most effective ways to increase the Data Warehouse performance; nevertheless, due to the computational complexity of the techniques aimed at choosing the best set of views to be materialized, this task is mainly carried out manually when large workloads are involved. In this paper we propose a set of statistical indicators that can be used by the designer to characterize the workload of the Data Warehouse, thus driving the logical and physical optimization tasks; furthermore we propose a clustering algorithm that allows the cardinality of the workload to be reduced and uses these indicators for measuring the quality of the reduced workload. Using the reduced workload as the input to a view materialization algorithm allows large workloads to be efficiently handled.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Genetic Relationships among Three Yarrow Species Based on Phenotypic Traits and Peroxidase Profiling

Fifteen yarrow populations from different species Achillea millefolium L., A. biebersteinii L. and A. nobilis, from different geographical areas of Iran were studied using 24 morphological traits and peroxidase profiles. Comparison of mean values of different phenotypic traits show A. millefolium and A. biebersteinii L. had higher plant height and crown diameter; however, A. nobilis had higher ...

متن کامل

Expression Profiling of Microarray Gene Signatures in Acute and Chronic Myeloid Leukaemia in Human Bone Marrow

Background Classification of cancer subtypes by means of microarray signatures is becoming increasingly difficult to ignore as a potential to transform pathological diagnosis nonetheless, measurement of Indicator genes in routine practice appears to be arduous. In a preceding published study, we utilized real-time PCR measurement of Indicator genes in acute lymphoid leukaemia (ALL) and acute m...

متن کامل

OPTIMIZATION OF FUZZY CLUSTERING CRITERIA BY A HYBRID PSO AND FUZZY C-MEANS CLUSTERING ALGORITHM

This paper presents an efficient hybrid method, namely fuzzy particleswarm optimization (FPSO) and fuzzy c-means (FCM) algorithms, to solve the fuzzyclustering problem, especially for large sizes. When the problem becomes large, theFCM algorithm may result in uneven distribution of data, making it difficult to findan optimal solution in reasonable amount of time. The PSO algorithm does find ago...

متن کامل

FlexSplit: A Workload-Aware, Adaptive Load Balancing Strategy for Media Cluster

A number of technology and workload trends motivate us to consider a new request distribution and load balancing strategy for streaming media cluster. First, in emerging media workloads, a significant portion of the content is short and encoded at low bit rates. Additionally, media workloads display a strong temporal and spatial locality. This makes modern servers with gigabytes of main memory ...

متن کامل

Bluetooth protocol profiling on the Xilinx Virtex II Pro

Nowadays, there is an increasingly stronger trend to integrate a multitude of functionalities into a single device. Traditionally, this has been achieved by utilizing more powerful general-purpose processors to handle the additional workload. Since then, application-specific processors (acting as co-processors or hardware accelerators) were introduced to offload part of these workloads and to m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003